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ABSTRACT 

We introduce a framework for simultaneously investigating the structure and lumi- 
nosity evolution of early-type gravitational lens galaxies. The method is based on the 
fundamental plane, which we interpret using the aperture mass-radius relations derived 
from lensed image geometries. We apply this method to our previous sample of 22 lens 
galaxies with measured redshifts and excellent photometry. Modeling the population 
with a single mass profile and evolutionary history, we find that early-type galaxies are 
nearly isothermal (logarithmic density slope n = 2.06 =b 0.17, 68% C.L.), and that their 
stars evolve at a rate of d\og{M / L) b / dz = —0.50 ± 0.19 (68% C.L.) in the rest frame 
B band. For a Salpeter initial mass function and a concordance cosmology, this implies 
a mean star formation redshift of {zf) > 1.5 at 95% confidence. While this model can 
neatly describe the mean properties of early- type galaxies, it is clear that the scatter of 
the lens sample is too large to be explained by observational uncertainties alone. We 
therefore consider statistical models in which the galaxy population is described by a 
distribution of star formation redshifts. We find that stars must form over a significant 
range of redshifts (Azj > 1.7, 68% C.L.), which can extend as low as 2;^ ~ 1 for some 
acceptable models. However, the typical galaxy will still have an old stellar population 
{{zf) > 1.5). The lens sample therefore favors early star formation in field ellipticals - 
even if we make no a priori assumption regarding the shape of the mass distribution in 
lenses, and include the range of possible deviations from homology in the uncertainties. 
Our evolution results call into question several recent claims that early-type galaxies in 
low-density environments have much younger stars than those in rich clusters. 



Subject headings: galaxies: elliptical and lenticular, cD - galaxies: evolution - galaxies: 
structure - gravitational lensing 
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1. Introduction 

Mapping the history of early- type (elhptical and lenticular; E/SO) galaxies is essential for 
testing the modern picture of galaxy formation. First, early-type galaxies are believed to result 
from the mergers of later- type disk galaxies (e.g., White & Rees 1978; Davis et al. 1985; Kauffmann, 
White & Guiderdoni 1993; Cole et al. 1994; Kauffmann 1996), and therefore provide a unique 
probe of hierarchical clustering in the cold dark matter (CDM) paradigm. Second, the evolution of 
their stellar populations places constraints on the most recent epoch of star formation, and hence, 
when mergers may have occurred. Semi-analytic CDM models (e.g., Baugh, Cole &: Frenk 1996; 
Kauffmann 1996; Kauffman &; Chariot 1998; Diaferio et al. 2001) have investigated the evolutionary 
history of early-type galaxies, and find significant environmental dependency. Specifically, they 
indicate that E/SOs in high-density environments such as clusters should have significantly older 
stellar populations than field ellipticals, many of which should have formed in recent mergers. 
Current surveys are now able to directly test these predictions by tracing the evolution of early- 
type galaxies in various environments out to z ~ 1. 

Ellipticals in rich clusters have been the most extensively studied, with samples now approach- 
ing z ~ 1.3 (van Dokkum & Stanford 2003). It is therefore not surprising that a broad consensus 
has been reached regarding their properties. The evolution in mass-to-light ratio inferred from 
the fundamental plane (FP) strongly indicates an early mean star formation epoch {{zf) > 2). 
For instance, the sample of van Dokkum et al. (1998) yields an evolution rate in the B band of 
d\og{M / L) B / dz = —0.49 ± 0.05 to z = 0.83, corresponding to a preferred formation redshift of 
[zf) ~ 2.5. The subsample sX z ^ 1.3 is consistent with an extrapolation of this result. Cluster 
galaxies are likely to experience significant morphological evolution, and the mergers in MS 1054- 
03 (van Dokkum et al. 1999, 2000) appear to be direct evidence for the hierarchical clustering 
model. Morphological evolution can lead to a "progenitor bias", by which the early-type galaxy 
population will appear systematically younger than its true age (Kauffmann 1996; van Dokkum 
&: Franx 2001). However, applying the maximal progenitor bias only changes the evolution rate 
to d\og{M/L)B/dz = -0.56 ± 0.05 {{zf) ~ 2; van Dokkum & Franx 2001). Because these and 
other evolution results (van Dokkum k. Franx 1996; Jorgensen, Franx & Kjaergaard 1996; Jor- 
gensen et al. 1999; Kelson et al. 1997, 2000; Wuyts et al. 2004) are strongly supported by color and 
spectral analyses (Bower, Lucey k, Ellis 1992; Ellis et al. 1997; Stanford, Eisenhardt & Dickinson 
1998; Ferreras, Chariot k. Silk 1999), there is little evidence to indicate that early-type galaxies in 
present-day clusters are anything but old, red and dead. 

The history of early-type galaxies in low-density environments is less well understood, but 
progress is being made. Many recent results based on analyses of the fundamental plane yield best- 
fit evolution rates of —0.8 < d\og{M / L) b / dz < —0.5, and are mutually consistent within their 
quoted uncertainties. For example, Treu et al. (2001, 2002) find d\og{M / L) b / dz = -0.72l|j;Jg, 
van Dokkum et al. (2001) and van Dokkum & Ellis (2003) find d\og{M/L)B/dz = -0.54±0.07, and 
van der Wei et al. (2004) find dlog{M/L)B/dz = —0.71 =b 0.20, each based on luminosity-selected 
samples out to z ~ 1. In addition, Rusin et al. (2003; hereafter R03) analyze a mass-selected sample 
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of gravitational lens galaxies over the same redshift range and extract a similar evolution rate of 
d\og{M / L) B / dz = —0.54 it 0.09, a result that is largely confirmed by van de Ven, van Dokkum &: 
Franx (2003; hereafter vvF). The only significant outlier among FP studies is the Gebhardt et al. 
(2003) measurement of d\og{M / L) b / dz ~ —1.0 to z = 1 (see also Im et al. 2002). 

Despite the broad consistency of the above evolution rates, there is some disagreement re- 
garding the implied star formation history of early-type field galaxies. For example, Treu et al. 
(2001, 2002) and Gebhardt et al. (2003) favor a mean star formation redshift of {zf) < 1.5, which, 
combined with cluster results, would appear to confirm the CDM prediction of younger stellar 
populations in field E/SOs. The presence of young stars is supported by spectral (Schade et al. 
1999; Kuntschner et al. 2002; Treu et al. 2002), color (Menanteau, Abraham & Ellis 2001; vvF) 
and evolutionary (van der Wei et al. 2004) evidence for some recent star formation. In contrast, 
the remaining FP analyses claim a mean star formation redshift of {zf) ~ 2, and little or no age 
difference between field and cluster populations. These results are supported by the spectral study 
of Bernardi et al. (1998) and the color analysis of Bell et al. (2004). Clearly more work is required 
to properly interpret current samples of early-type field galaxies, and account for the possibility 
that stars form over a range of redshifts. 

The focus of this paper is the gravitational lens sample. Lenses are uniquely suited to ex- 
ploring the luminosity evolution of the early-type galaxy population in low-density environments 
(e.g., Keeton, Kochanek & Falco 1998; Kochanek et al. 2000; R03). First, lenses are mass-selected 
galaxies, and are therefore less susceptible to Malmquist biases which will favor the inclusion of 
brighter, bluer and younger galaxies in luminosity-selected samples. Second, because the lensing 
cross section scales strongly with velocity dispersion, almost all lens galaxies are of early morpho- 
logical types. Third, the optical depth is dominated by galaxies in low-density environments, such 
as the "field" and poor groups (Keeton, Christlein & Zabludoff 2000). Fourth, lensing naturally 
selects a galaxy sample out to z ^ 1, due to the redshift spread in the source population and 
competing volume and cross section effects in the optical depth. Fifth, lenses constrain projected 
masses, which can be used to normalize the galaxy population to a common scale. 

Galaxy evolution results based on gravitational lens samples are often viewed with skepticism, 
as these investigations (Kochanek et al. 2000; R03; vvF) do not use measured velocity dispersions 
in their FP analyses. There is a legitimate reason for this: with a handful of exceptions (Foltz et al. 
1992; Lehar et al. 1996; Tonry 1998; Ohyama et al. 2002; Koopmans & Treu 2002, 2003; Koopmans 
et al. 2003; Gebhardt et al. 2003; Treu & Koopmans 2004), the velocity dispersions of lens galaxies 
have not been measured. However, the geometry of the lensed images allows the velocity dispersion 
to be estimated very accurately, given a model for the mass distribution. An isothermal {p oc r^^) 
model is the common choice, as it is consistent with most constraints on lens galaxies (Kochanek 
1995; Cohn et al. 2001; Muhoz, Kochanek & Keeton 2001; Treu & Koopmans 2002a; Rusin et al. 
2002; Koopmans & Treu 2003; Winn, Rusin & Kochanek 2003; Rusin, Kochanek & Keeton 2003; 
Koopmans et al. 2003; Treu & Koopmans 2004) and local ellipticals (Fabbiano 1989; Rix et al. 1997; 
Gerhard et al. 2001; see Romanowsky et al. 2003 for an alternative conclusion). In the context 
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of evolution studies, the issue is then whether the isothermal assumption is robust and sufficient. 
Criticisms of R03 (Treu & Koopmans 2002b; Gcbhardt ct al. 2003) have centered on claims that 
velocity dispersion measurements of a few specific lens galaxies require mass profiles which are 
different than isothermal. The implication is that the results of R03 and similar papers can be 
discarded on the grounds that potential systematic and statistical deviations from the isothermal 
assumption may significantly affect evolution constraints, but have not been properly taken into 
account. We must therefore determine how constraints on the luminosity evolution model depend 
on galaxy structure, and the observationally permitted range of mass profiles. 

The primary goal of this paper is to develop and apply a framework for simultaneously in- 
vestigating the structure and evolution of gravitational lens galaxies. In this way we can consider 
the range of mass profiles allowed by the ensemble of lens geometries, and its effect on stellar 
population constraints. In essence, we derive limits on luminosity evolution without any a priori 
assumption regarding the mean mass distribution in early-type galaxies. The theoretical frame- 
work is built upon the self-similar, two-component mass models introduced and analyzed by Rusin, 
Kochanck & Kccton (2003; hereafter RKK). We show that the formalism is very closely related 
to the fundamental plane, and is therefore a powerful tool for tracing luminosity evolution. The 
results of our combined structure/evolution analysis bolster the previous claims of R03 that older 
stellar populations are strongly favored for field E/SOs. 

A secondary goal of this paper is to better quantify scatter within the lens galaxy sample, 
and among early-type field galaxies in general. Unlike cluster ellipticals, which typically show 
little galaxy-to-galaxy age variance, field ellipticals exhibit significant scatter about their best-fit 
"mean" evolutionary tracks (e.g., Treu et al. 2001, 2002; RKK; wF, Treu & Koopmans 2004; van 
der Wei et al. 2004), strongly suggesting a range of stellar ages at z ~ 0. Therefore, in addition 
to investigating single-epoch star formation models to describe the mean evolutionary history of 
early-type field galaxies, we also consider a simple model in which galaxies form over a range of 
rcdshifts. In doing so, we obtain the first stellar evolution model which reproduces the statistical 
properties of the lens sample. 

Section 2 outlines the theoretical model for investigating galaxy structure and evolution. Sec- 
tion 3 details our lens sample and calculations. Section 4 describes our results. Section 5 compares 
our findings with other lensing analyses. In §6 we summarize our results and discuss their implica- 
tions. We assume JIm = 0.3, ^\ = 0.7 and Hq = 65kms~^ Mpc~^ for all calculations. 

2. Structure and Evolution in a Common Framework 

2.1. General Theory 

The geometry of a gravitational lens system yields a precise, model-independent relationship 
between the aperture radii defined by the lensed images and the projected masses they enclose 
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(Schneider, Ehlers & Falco 1992). For rings and four-image lenses (quads), the Einstein radius 
REin is related to the aperture mass Meiu'- 



7ri?|.„ AirGDaDds ~ " ' 

Note that modeling can remove the effects of galaxy ellipticity and external shear in quads and 
rings, leaving a neghgible (~ 2%) uncertainty on eq. (1). For two-image lenses (doubles), the image 
radii R\ and R2 are related to the enclosed masses M(i?i) and M{R2): 



1 

TT 



M(i?i) M(i^2) 
Ri R2 



R1+R2 



Scr • (2) 



Modeling cannot robustly remove the effects of galaxy ellipticity and external shear in doubles. 
However, we can account for typical quadrupoles by assuming a ~ 10% uncertainty on eq. (2). 

The quantity Ecr, defined in eq. (1), is the critical surface density, which depends on the angular 
diameter distance to the lens (Dfi), to the source (Ds), and from the lens to the source (Dds). RKK 
demonstrate that an ensemble of aperture mass-radius (AMR) relations can be used to statistically 
constrain the mean mass profile of lens galaxies, within the context of a model which relates the 
mass distribution to the light. 

Once a given mass profile has been normalized to satisfy the aperture mass-radius relation in 
a particular lens, we can calculate the line-of-sight stellar velocity dispersion at radius R by solving 
the Jeans equation. For constant orbital anisotropy Piso = ^ — (^"e/^r^ solution is 



/W.^(H) = 2£d.(l-ft,„^).-^.-/~*«M„=^... , (3) 



where I{R) is the surface brightness and v{r) is the corresponding volume distribution of the 
luminous matter. The luminosity-weighted velocity dispersion inside some (circular) spectroscopic 
aperture R is then: 

(a2(i?)) = ^ (fR'l{R')a\R') j ^ (fR'l{R') . (4) 

Note that galaxy ellipticity and external shear do not significantly affect the dynamical estimate 
(Kochanek 1994). The mass normalization required by the AMR relation makes the stellar velocity 
dispersion a very sensitive probe of the radial mass profile in lens galaxies (e.g., Treu & Koopmans 
2002a; Koopmans & Treu 2003). 

Our ability to estimate the velocity dispersion from the image geometry means that we can 
investigate the luminosity evolution of lenses using the fundamental plane (FP; Djorgovski &; Davis 
1987; Dressier et al. 1987): 

log Re = a log (7c + P{SBe) + 7 , (5) 
where Re is the effective (half-light) radius, (SBe) is the mean absolute surface brightness enclosed 
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by Re, and cTc is the central stellar velocity dispersion. The FP allows us to predict the surface 

brightness that a galaxy would have at z = 0, given its velocity dispersion and effective radius. 
Assuming that the structural parameters do not evolve (the same assumption which has been made 
in all previous evolution analyses of the FP; e.g., Jorgensen et al. 1996, 1999; Kochanek et al. 2000; 
Treu et al. 2001, 2002; van Dokkum et al. 1998, 2001; R03; vvF; Gebhardt et al. 2003), the difference 
between the observed surface brightness and the predicted z = value is then directly related to 
the luminosity evolution: Alog(M/L) = 0.4A(S'Se) = — O.4A7//?. This technique estimating 
velocity dispersions from mass models normalized to match the AMR relations and plugging the 
values into the FP - has been employed by previous studies to trace the luminosity evolution of 
lens galaxy samples (Kochanek et al. 2000; R03; vvF). The drawback of these analyses is that they 
each focussed on the isothermal mass model, and did not thoroughly consider possible deviations 
or the range of permitted profiles.^ 

Our goal is to investigate galaxy evolution without any a priori assumption regarding the 
shape of the mass distribution. Of course, we cannot independently set the profile of each lens 
galaxy. Instead, we assume that early-type galaxies form a regular population, in which the mass 
distribution is related to the light distribution. We can then use the FP to simultaneously investigate 
galaxy structure and evolution. Specifically, for a given mass profile, the central velocity dispersion 
can be calculated for each lens once the profile normalization has been set by the AMR constraint. 
These velocity dispersions can then be compared with the predictions of the FP/evolution model: 



where 70 is the present-day intercept, and e{z) is the evolution in mass-to-light ratio to the galaxy 
redshift z. For now we assume that all galaxies are described by a single evolutionary track; more 
general models will be considered later. The model may then be evaluated by finding the mass 
profile, evolution model and zero-point that minimize the scatter of the FP, which is known to be 
small at both high and low redshift, and in all environments (e.g., van Dokkum & Franx 1996; 
Jorgensen et al. 1996; Kelson et al. 1997; van Dokkum et al. 1998, 2001; Pahre, Djorgovski, & de 
Carvalho 1998a; Bernardi et al. 2003). 

The FP technique described above is very closely related to the homology formalism of RKK. 
For a homologous mass distribution, M = cm cr^ Re/G, where cm is a constant. Substituting the 
definitions of the surface brightness (L oc leRe-, [SB^) = — 2.51og/e) and implicitly evolving all 
luminosities to present-day, we have 



^Thc quantity ac is often defined within a standard aperture of 3'.'4 (diameter) at the distance of the Coma cluster 
(e.g., Jorgensen et al. 1996), or some fixed fraction of the effective radius (typically -Re/8). 

^Note that Kochanek et al. (2000) experimented with models in which mass traces light, and found little difference 
in the favored evolution rate. 



log GFP = 



logi?e-/3[(5Be)-2.5 e(z)]-7o 



(6) 
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If we allow the mass-tolight ratio to scale as = [I^I^Y , then 




2x+l ' 



(8) 



which looks like the fundamental plane (eq. 5), with a = 2/(2x + 1) and {3 = 0.4(x + l)/(2x + 1) 
(see also Faber et al. 1987). 

The RKK homology formalism (with luminosity evolution) is identical to the real FP under 
three reasonable conditions. First, the FP slopes must be consistent with the definition of x, which 
requires that q;//3 = 5/(x + 1). Note that this condition is met by measurements of the local FP 
in the B band, which favor slopes (1.20 a < 1.25 and [3 ~ 0.32; Jorgensen et al. 1996; Bender et 
al. 1998) that are very close to those predicted for x = 0.3. Second, we must fit the local intercept 
7o, which depends on a combination of cm and the stellar mass-to-light ratio. Fitting for 70 is 
a reasonable procedure, as local measurements of the FP intercept in low-density environments 
are scarce. Third, the velocity dispersion must be measured/estimated in a spectroscopic aperture 
which is some fixed fraction of Re- Such a definition is already used in many FP analyses (e.g., 
Jorgensen et al. 1996, 1999; van Dokkum et al. 1998, 2001; Bcrnardi et al. 2003). Note that the 
size of the spectroscopic aperture relative to is irrelevant if the intercept is fit, as it changes 
only the dynamical constant cm, which has no effect on evolution rates unless the homology breaks 
down significantly or evolves with redshift. The value of Piso used to estimate velocity dispersions 
is also irrelevant for our analysis, as it can be incorporated into cm- We make the assumption 
of constant l3iso for computational simplicity. However, the anisotropy is equally irrelevant for an 
Osipkov-Merritt (Osipkov 1979; Merritt 1985a,b) parameterization [f3iso{r) = r"^ / {r'^ + rf)], so long 
as we assume that the isotropy radius rj is proportional to the effective radius. 

Consequently, the FP and homology frameworks are interchangeable, with the AMR relations 
derived from image geometries acting as a suitable proxy for velocity dispersions. By incorporating 
stellar evolution explicitly into the homologous models for galaxy mass distributions, we can harness 
the power of the FP to simultaneously constrain the structure and luminosity evolution of early- 
type lens galaxies. The combined analysis allows us to obtain limits on luminosity evolution that 
are relatively unbiased with respect to the choice of mass model. A certain range of mass profiles 
will be consistent with a tight fundamental plane, and this range dictates the degree to which 
a strong homology might be violated in this galaxy population. The effect of this range on the 
evolution estimates can then be taken into account. 



We begin with a two-component model relating the mass and light in galaxies. The model is 
parameterizied in terms of projected quantities, as these are directly constrained by gravitational 
lenses. However, volume mass distributions are necessary for evaluating the Jeans equation (eq. 3). 
We therefore employ three-dimensional analogs of the models used by RKK. 



2.2. 



Mass Models 
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The mass component which traces the stars is modeled as a Hernquist (1990) profile, with 
volume density 

^'""^^""^ = {r/a){l+r/af ' 

where a is the characteristic radius. In projection, the Hernquist model closely approximates a 
de Vaucouleurs surface brightness profile for a = 0.551-Re- The fraction of the total luminosity 
projected inside R is denoted g{R/Re)- Allowing the mass-to-light ratio of the stars to vary with 
galaxy luminosity (T oc L^), the projected luminous matter inside an aperture R is then 



MiumiR) = T,L,i^-^j gi^—j , (10) 

where quantities are scaled relative to those of an L* galaxy, which we define as having a present- 
day {z = 0) characteristic magnitude of M^{0) = —19.9 -|- 51og/i in the rest frame B band (e.g., 
Madgwick et al. 2002). The quantity Lg^ is the galaxy luminosity corrected for stellar evolution, 
which allows all lenses to be evaluated at a common age. (see §2.3). 

The dark matter halo is modeled using the cuspy profile of Munoz et al. (2001), with volume 
density 

where rf, is the break radius. The profile follows the generalized form of a simulated CDM halo 
(Navarro, Frenk & White 1997; Moore et al. 1999), which has a logarithmic density slope n for 
r -C Tfe and 3 for r ^ ri,. In most cases we will consider the "power-law" limit with rh S> Re- The 
shape of the projected CDM mass profile is denoted as mcdm{R/Re), which we normalize such that 
mc(im(2) = 1- The abundance of CDM is described by fcdrm the fraction of aperture mass in the 
form of dark matter inside R = 2Re. The total (luminous plus dark) aperture mass enclosed by R 
is then 



M{R) = 
with g{2) = 0.69. 



(R \ , , fcdm f R 

R" + 9{^h 7 '^<^rn I -B- 



(12) 



While our mass model is based on the assumption of strong homology, determining the degree 
to which early- type galaxies form a homologous population remains a problem of great interest 
(e.g., Faber et al. 1987; Caon, Capaccioh & D'Onofrio 1993; Bertin et al. 1994, 2000; van Albada, 
Bertin, & StiavcUi 1995; Ciotti, Lanzoni & Rcnzini 1996; Pahre, dc Carvalho & Djorgovski 1998b; 
Gerhard ct al. 2001; BorricUo, Salucci & Danesc 2003; Trujillo, Burkert &: Bell 2004). For example, 
there is strong evidence that the total mass-to-light ratio increases with luminosity (e.g., Gerhard 
et al. 2001; Bernardi et al. 2003; Borriello et al. 2003; RKK; Padmanabhan et al. 2004), but 
little consensus as to the source of this effect. Gerhard et al. (2001) argue that we are seeing a 
luminosity dependence of the stellar mass-to-light ratio. In contrast, Padmanabhan et al. (2004) 
claim that the effect is due to an increasing dark matter abundance with galaxy luminosity, while 
the stellar populations exhibit negligible variation (see also Kauffmann et al. 2003) . We have tested 
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a non-homologous mass distribution, in which the dark matter abundance fcdm varies with galaxy 
luminosity as 

J cam / 



-If — fcdm — fcdm,* ( r ) ' (-'-^) 
J- ~ Jcdm \ ^* J 

where Vcdm = ^cdmi'^Re) /^lumi'^Re)- Unfortunately, the tests show that our lensing analysis has 
no power to distinguish even the extreme cases of y = (structurally homologous) and x = 
(systematically non-homologous), as we can measure only a combination of parameters x + y. 
Specifically, we obtain a virtually identical value for x if y = as we do for y if x = 0, with no 
difference in the goodness of fit or the favored evolution model. This result is primarily due to the 
fact that lenses measure total masses only. Consequently, homologous mass distributions contain 
all the phenomenology we can probe: namely, the concentration of the mass distribution and the 
dependence of the total mass-to-light ratio on luminosity (rather than the individual behavior of 
either mass component). The evolution estimates are the same under either structural assumption. 

Strict homology is also relaxed if galaxies exhibit structural scatter at fixed luminosity. This 
could be tested by modeling the galaxy population with a statistical distribution of mass profiles, 
much as we will do for star formation redshifts (see §2.3). In practice, however, this is difficult 
to accomplish within the context of our global mass model. For a given galaxy, different mass 
profiles require different stellar mass-to-light ratios, which renders impossible a parameterization of 
the galaxy population in terms of a single value of T*. We choose to forgo the direct modeling of 
profile scatter, as we do not wish to add more nuisance parameters to cloud the analysis. Moreover, 
in §4.2 we show that structural scatter is not necessary to produce a good statistical description of 
the data. Hence, a single profile model satisfies Occam's razor. By ignoring structural scatter, we 
will maximize the range of formation redshifts necessary to account for the scatter observed about 
the AMR relations. 



2.3. Evolution Models 

We account for evolutionary effects by converting each observed galaxy luminosity L{z) to a 
common stellar age, yielding an evolution-adjusted luminosity L^.^. This may be accomplished in 
two ways, depending on the complexity of the star formation model. 

To derive constraints on the mean star formation redshift and evolution rate, we assume that 
the galaxies are described by a single star formation redshift (Model 1). For such a model we can 
normalize the galaxy luminosities at any redshift we desire, and simplicity dictates that we evolve 
them to z = 0: 

logLe^ = logL(O) = logL(z) + e(z), (14) 

where e{z) is the change in mass-to-light ratio out to redshift z. We describe luminosity evolution 
with a pair of models. First, there is a linear model in which e{z) = [d\og{M / L) b / dz\ z. This is a 
useful approximation for older stellar populations, and offers a standard for comparing results from 
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different analyses. Second, e{z) can be set according to the detailed evolutionary tracks of stellar 
population models, which we calculate using the GISSEL96 version of the Bruzual & Chariot (1993) 
synthesis code (hereafter denoted "BC96"). This technique facilitates constraints on the mean star 
formation redshift. A statistic is sufficient to evaluate Model 1. 

Modeling the early-type galaxy population with a single formation redshift or evolution rate 
rarely provides a good statistical description of the data (e.g., vvF; van der Wei et al. 2004). 
Therefore, we also investigate a model in which star formation takes place over a range of redshifts 
(Model 2) . This is the first time that such a model has been applied to a gravitational lens sample. 
As in Model 1, galaxies are formed obeying the self-similarity relation (eq. 12), though in this case 
they will exhibit a range of ages at any given redshift. Because we want to evaluate the galaxies 
at a fixed age, we use a stellar evolution model to convert the observed luminosity to its value at 
formation: Lg^ = L[zf). Note that in this context T* represents the characteristic mass-to-light 
ratio at We consider a model in which star formation takes place between Zf^min and zj^max, 
with uniform probability density in log zf. dP/d\og Zf = constant. The added complexity of Model 
2 requires a likelihood formalism for analysis. 



3. Data and Analysis 

3.1. Lens Sample and Parameters 

We reanalyze the gravitational lens sample of RKK, which includes 22 early-type galaxies and 
bulges with measured redshifts. The raw photometric (total magnitudes and effective radii from 
Hubble Space Telescope observations) and geometric (Einstein radii for ring and quad lenses; image 
radii for doubles) parameters are listed in R03 and RKK, respectively. Assuming an isothermal 
profile, our lens galaxy sample is characterized by a mean velocity dispersion of ~ 240kms~^. 

For each galaxy, we convert the observed magnitudes {mobs,Y in filter Y) into a rest frame 
B-hand magnitude using the procedure outlined in R03. This involves using the BC96 spectral 
evolution code to calculate a spectral energy distribution (BED) as a function of redshift, which 
is then convolved with transmission curves for HST filters (Holtzman et al. 1995). We construct 
synthetic "colors" C{B,Y) = mmod,B — ^mod,Y between rest frame magnitudes in filter B and 
directly measurable magnitudes in filter Y for a galaxy at the appropriate redshift. The rest frame 
magnitude is then 

Ey Kbs.y + C{B, Y)]/{6mobs,Y? nK^ 

Ey ^/[SmobsXr 

where the sum is taken over all filters in which the galaxy has been observed. The model SED 
depends on the star formation history, metallicity, initial mass function (IMF) and cosmology. 
We take as a fiducial model an instantaneous star burst at = 3 with solar metallicity Z = 
Zq and a Salpeter (1955) IMF, along with our standard cosmological parameters. For a given 
spectrophotometric model, errors on the derived rest frame magnitudes tend to be small (< 0.1 
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mag), as most lens galaxies in our analysis sample have good photometry in at least one filter (see 
R03). However, the conversion does depend on the assumed spectral template. We account for this 
in our error budget by considering the scatter induced by a broad ensemble of plausible models 
{1 < Zf < 5, OAZq < Z < 2.5Zq). This results in an additional uncertainty of 0.1 — 0.15 mag in 
the interpolated magnitudes. Following R03 and RKK, we therefore set a uniform error of 0.2 mag 
on the derived rest frame magnitudes, or SlogL = 0.08. We will model these and all subsequent 
errors as Gaussian. 

It is important to note that the assumed value of the luminosity error has a limited effect on 
our analysis. For Model 1, our determination of whether the model is a good statistical description 
of the data is based on the best-fit x^: which of course depends on ^logL. Because a model with 
a single star formation redshift is expected to perform poorly, we obtain constraints on individual 
parameters (using standard Ax^ limits) by rescaling all errors such that the best-fit is equal to 
the number of degrees of freedom. Post-rescaling, the initial value of S log L is of little importance. 
The rescaling procedure yields constraints similar to those derived from a jackknife error analysis or 
bootstrap resampling (each without error rescaling) , as all three techniques are designed to provide 
error bars which are consistent with the observed scatter. For simplicity we will quote only the 
Ax^ results. In Model 2 we seek a good fit to the data in the absolute sense (sec §3.2), so error 
rescaling will not be used. While our parameter constraints do vary with SlogL, our ability to 
derive a statistically consistent model does not. We have tested several values (0.02, 0.04, 0.08) of 
6\ogL, and find that each yields models which provide a good statistical description of the data. 

More important than the assumed value of S log L is that we apply it uniformly to all lenses. 
This ensures that each galaxy is an important contributor to the fit, which allows us to better 
constrain the "mean" properties of the population. As discussed in §5.1, uniform luminosity errors 
should reduce biases in constraining evolutionary models. 

The errors in the radii entering the AMR relations are negligible, and we can safely ignore 
them. Uncertainties in the model-predicted masses or velocity dispersions are dominated by the 
luminosity errors, which are summarized above. Parameters correlated with the effective radius are 
treated as in RKK, though these effects are also negligible. Four of the systems in our sample do 
not have measured source redshifts. For these we assume Zg = 2.0±1.0 and derive uncertainties in 
log according to the Monte Carlo procedures detailed in RKK. Finally, because unconstrained 
quadrupoles effectively smear the AMR relation for doubles, we include an additional tolerance of 
10% in mass (or 5% in velocity dispersion) for such cases. 

3.2. Calculations 

The AMR relations provide constraints on the homologous mass model, or equivalently, on the 
FP. For this purpose we introduce a function to describe the offset of each galaxy from its relevant 
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AMR relation (eq. 1 for quads and rings, eq. 2 for doubles): 




f logM^od(^Em,i) - log(Scr,iVri?|.^^.) rings, quads 

\ ^Og{Mmod{Rl,i)/Rl,i + Mmod{R2,i)/R2,i) - log{T,cr,iTT{Rl,i + i?2,i)) doubleS 
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We constrain Model 1, which postulates a single mass profile and a single star formation 
redshift for the lens galaxy population, by optimizing the goodness-of-fit parameter 



where Ssd is the error rescaling factor. The logarithmic uncertainty SAMR,i on each data point is 
derived using the Monte Carlo methods outlined in §3.1, but is well approximated as S\j^jj^^ ~ 
(1 + ,t)^((5 log L)^ + ((5 log Scr,i)^ + where 6j is the additional 10% tolerance for quadrupole- 
related smearing of the mass-radius relation in doubles. We set 6^ = for quads and rings, and 
S log T,cr,i = for systems with a measured Zs ■ 

Alternatively, we could use the FP approach and evaluate Model 1 using the goodness-of-fit 
function 



where ac,i is the velocity dispersion calculated from the AMR-normalized mass profile, and aFP,i 
is the value predicted from the FP/evolution model (eq. 6). The uncertainty parameter is Spp^i = 
0.5 6am R,i- While this looks like a very different formalism, it is not. Analyzing estimated velocity 
dispersions with the FP (eq. 18) yields numerically identical results to analyzing AMR relations 
with the homology model (eq. 17). Only if the local intercept 70 is known to high systematic and 
statistical accuracy (^ 0.1) does using the FP formalism add any information. However, to make 
use of the local zero-point, one must also know the slopes of the FP and the orbital anisotropy. In 
addition, the velocity dispersion must be estimated in a spectroscopic aperture identical to the one 
used to derive the local FP parameters. We therefore find that the mass homology relation for lenses 
contains all the information of the FP with fewer systematic errors given the direct relationship of 
lens data to mass rather than velocity dispersion. 

Following optimization of the parameters, we set 6sd so that the best-fit model has = ^dof, 
the number of degrees of freedom. This preserves the relative weighting among the data points, 
which naturally gives less weight to doubles and those systems with an estimated Zg , and therefore 
does not alter the optimized values of the model parameters. However, rescaling does allow us to 
relate the uncertainties to the observed scatter in the model, yielding parameter errors which agree 
with those derived from the bootstrap and jackknife techniques (see §3.1). Rescaling is particularly 
important because Model 1 is undoubtedly a simplistic representation of galaxy structure and 
evolution. Hence, the is likely to be dominated by unmodeled complexity (i.e., deviations from 
self-similarity and an intrinsic spread in star formation redshifts) in the galaxy population, rather 
than by observational errors. 




(17) 




(18) 
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Model 2 postulates a homologous mass profile for the lens sample, but allows for star formation 
over a range of redshifts. This adds significant complexity to the analysis. Specifically, while a single 
formation redshift yields a single value of Aj^mr for each galaxy (given a fixed set of structural 
parameters), a distribution of formation redshifts yields a distribution dP/d/S.AMR- Because a 
statistic alone is insufficient for evaluating Model 2, we turn to a likelihood formalism. We evaluate 
the likelihood ln£ = ^^ln£j, where 

and Xamri '^^ defined in eq. (17). The lower limit of integration is Zf^ = max(z/^j„j„, Zj), where 
Zi is the redshift of the ith lens galaxy. Confidence limits on parameters are derived in the usual 
manner for maximum likelihood methods by using differences in \nC There is no rescaling of errors 
in Model 2. Note that the likelihood formalism (eq. 19) should reduce to the formalism (eq. 17) 
in the limit of a single star formation redshift (Model 1). We have confirmed that the techniques 
yield identical parameter constraints in this case, so long as we keep Ssd = 1- 

In addition to finding the best-fit model, we also want to determine whether it is a good 
fit to the lens data in the absolute sense. For Model 1, we simply compare the best-fit (for 
^scl = 1) to the number of degrees of freedom. Model 2, however, requires a more intricate, 
Monte Carlo-based approach in which we select and analyze simulated lens samples, which are 
based on the geometries of our actual lenses. First, we randomly select the formation redshift 
for each galaxy according to the statistical distribution dP/dzf, and set the formation luminosity 
such that the homologous mass model perfectly reproduces the AMR relation. The luminosity is 
then evolved to the measured galaxy redshift, and perturbed according to the assumed luminosity 
errors. Uncertainty in the mass scale due to an estimated source redshift is handled by an additional 
perturbation. We calculate the likelihood of each simulated data set given the model {In Cgim), and 
build up an expected likelihood distribution {dP/dln Csim), which wc crudely parameterize by its 
mean {In Csim) and standard deviation InC^^. If the real lens sample is well described by the 
model, we would expect its likelihood (ln>C) to be consistent with the distribution dP /din Csim- 
We define consistency as |ln£ — (\.nCsim)\ < ln£^^. Finally, we note that these Monte Carlos 
test a necessary - but not a sufficient - condition for compatibility between samples (in this case, 
real and simulated data). 



4. Results 

4.1. Focus on Structure 

We begin by updating the RKK analysis of galaxy structure, taking into account the effects 
of galaxy evolution. To facilitate comparisons with the RKK results, we will derive our structural 
constraints using Model 1, which postulates a single star formation redshift. We assume the linear 
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evolution model, but replacing this with the detailed evolutionary tracks has little quantitative 
effect. In Fig. 1 we plot constraints on the dark matter abundance fcdm stnd inner power-law slope 
n by simultaneously optimizing the other parameters - x, d\og{M / L) b / dz and the normalization 
(70 for the FP, T^, for the homology formalism). Solid lines are the 68% (A^^ < 2.30) and 95% 
(Ax^ < 6.17) confidence limits in two dimensions; dotted lines are the 68% (Ax^ < 1) and 95% 
(Ax^ < 4) confidence limits in one dimension. These and all subsequent values of Ax^ follow the 
setting of the scale factor 5sci such that the best-fit model has x^ = Ndqf- Constraints arc shown 
for three different values of the CDM break radius, Rb/Re = 3, 10 and 50. Smaller values of Rb/Re 
mean that the steep outer slope has a more prominent effect on the radial scales probed by the 
lens sample (0.2 < R/Rg < 7; see RKK). To compensate, the inner slope n must become shallower 
as Rb/Re is decreased. The best-fit models have x^ = 44.1, 43.8 and 44.0 for Rb/Re = 3, 10 and 
50, respectively, corresponding to an rms scatter of 0.16 in log(M/L). Based on our estimated 
uncertainties, this implies an intrinsic scatter of ~ 30%. Note that a of 44 for Ndqf = 17 is 
a very poor fit, suggesting that structural homology and a single star formation rcdshift do not 
provide a good description of the lens data (sec also vvF). This is also the case for the standard FP, 
whose intrinsic scatter is larger than can be explained by measurement errors alone (e.g., Jorgensen 
et al. 1996). 

Our nearly identical values of x^ for the different break radii indicate that our model is over- 
parameterized. For simplicity we will set Rb/Re = 50 for all subsequent calculations, and therefore 
treat the dark matter halo as a power law. Note, however, that this limit is not identical to the 
power-law surface mass M{R) oc i?'^^" which RKK used to describe the halo. Specifically, the RKK 
model does not have a consistent three-dimensional projection for n < 1. As n ^ 1 the two models 
are quantitatively different, with the RKK power law approaching a mass sheet (S = constant), 
and the cuspy model approaching a logarithmic surface density distribution (S oc InR). Hence, for 
a given point {fcdm and n ~ 1) in the parameter plane describing our new model, the projected 
mass profile is slightly steeper than the same point in the RKK model plane. 

Despite our inability to separate the effects of the inner power-law slope, break radius and CDM 
mass fraction, all acceptable models are very close to isothermal on the scale of several effective 
radii. Wc plot the mass profiles in Fig. 1. Our profile constraints agree with those of RKK, who 
assumed a fixed evolution model corresponding to a star formation rcdshift of Zf = 3. By including 
luminosity evolution as a free parameter, however, we allow for a slightly broader range of mass 
profiles. In the limit of a nearly scale-free mass model (T* — > 0, fcdm ~^ 1) Rb/Re ^ l); our 
constraint on the logarithmic density slope is n = 2.06 ± 0.17, compared to the RKK limit of 
n = 2.07 =b 0.13.^ Each constraint is at 68% confidence, or Ax^ < 1. We still detect dark matter to 
very high significance, as a model in which mass traces light is rejected at Ax^ = 9.2 (compared 



^Using Model 2 we obtain n — 2.05 ± 0.11, which is consistent with the resuhs from Model 1. The absence of 
error rescaling in Model 2 (see §3.2) accounts for most of the difference in the profile uncertainty. Consequently, we 
conclude that allowing for a range of star formation redshifts has little effect on the mean mass profile favored by the 
lens sample. 
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to Ax^ = 10.7 in RKK). In summary, uncertainties in the luminosity evolution do not affect our 
conclusions that early-type galaxies are nearly isothermal on the scale of a few effective radii, and 
that a small but non-zero fraction of dark matter {fcdm > 0.08, 95% C.L.) is required in this regime, 
independent of the stellar mass-to-light ratio. 

The zero-point of the FP depends on some combination of the radial mass profile, mass-to- 
light ratio, orbital anisotropy and spectroscopic aperture, as these factors determine the dynamical 
constant cm (see eq. 7 and 8). If the local zero-point 70 has been accurately measured for a 
given set of slopes a and /?, then the FP formalism adds information that is not included in the 
homology formalism, and could therefore improve our model constraints. We refit the intercept 
in our analysis because there is no accurate, independent measurement of 70 specifically for low- 
density environments. However, we can estimate the precision to which 70 must be measured in 
order to affect constraints on our structure/evolution model. In Fig. 2 we show the best-fit value of 
70 as a function of mass profile. For demonstration purposes we assume FP slopes corresponding to 
X = 0.3 (a = 1.25 and (3 = 0.325), and calculate velocity dispersions for isotropic orbits {Piso = 0) 
in an aperture with radius Re/ 8. We find that 70 varies slowly over the parameter plane if the FP 
slopes are fixed. The values vary wildly if x is simultaneously fit. The value of 70 must be measured 
to a precision of <C 0.1 to improve the model constraints. We therefore find that the FP adds little 
useful information beyond that already contained in the AMR relations. 

While we have assumed that early-type galaxies are homologous, constraints on the typical 
mass profile (Fig. 1) show the range over which individual galaxy profiles might reasonably scatter. 
Dynamical studies of individual lenses can directly address departures from homology, and are 
therefore a powerful complement to our statistical analysis. Velocity dispersions have been measured 
for seven of the lenses in our sample: Q2237-I-0305 (Foltz et al. 1992), MG1549-f3047 (Lehar et 
al. 1996), PG1115-h080 (Tonry 1998), HST14176-h5226 (Ohyama et al. 2002; Gebhardt et al. 2003; 
Treu & Koopmans 2004), MG2016-M12 (Koopmans & Treu 2002), 0047-281 (Koopmans & Treu 
2003), and B1608-I-656 (Koopmans et al. 2003). Two of the galaxies are particularly interesting, as 
dynamical measurements would imply that they have mass profiles which deviate significantly from 
isothermal. PG1115-I-080 (Weymann et al. 1980) has a velocity dispersion which is higher than the 
isothermal prediction, indicating a steeper mass profile (Tonry 1998; Treu & Koopmans 2002b). In 
contrast, HST14176-I-5226 (Ratnatunga et al. 1995) has a velocity dispersion which is lower than 
the isothermal prediction (Ohyama et al. 2002; Gebhardt et al. 2003; Treu &: Koopmans 2004), 
indicating a shallower mass profile. Based on the existence of such structural outliers, it has been 
argued that estimated velocity dispersions are insufficient for evolution studies with gravitational 
lens samples. We now show that this will not be a significant concern for our new evolution 
constraints (§4.2). 

We wish to determine the degree to which dynamical measurements of lens galaxies are com- 
patible with the range of mass profiles allowed by our FP/homology analysis. First, at each point 
in {fcdm — n) space, we set the profile normalization of the lens according to its AMR relation. 
We then estimate the velocity dispersion and compare it to the measured value. For our dynam- 
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ical estimates we use a circular aperture with an area equal to that of the aperture in which the 
velocity dispersion has been measured, and assume isotropic orbits {(3iso = 0). We expect our 
comparisons to be accurate at the level of ~ 15kms~^. For each lens, we plot in Fig. 3 contours 
of constant a, along with our global profile constraints. We see that five of the seven lenses are 
nearly isothermal, and are consistent with our 68% confidence limits on the mean mass profile. 
Note that while our constraints on the individual mass profiles are qualitatively similar to those 
derived in the literature, there are small systematic disparities due to the use of differing structural 
parameters (effective radius, image separation, or environmental corrections). We confirm that the 
dynamical measurements of two galaxies require profiles that are formally inconsistent with isother- 
mal. However, the comparison ignores systematic uncertainties in the dynamical model due to the 
derivation of velocity dispersions from Gaussian profiles and the ellipticities of the galaxies, each 
of which will reduce the significance of these deviations (see Kochanek, Schneider & Wambsganss 
2004 for additional discussion). For PG1115+080, Tonry (1998) measures a = 281 ± 25kms-i 
in a I'.'O squared aperture. We find that this galaxy must have a profile which is significantly 
steeper than isothermal: p oc r^^'^, similar to the value favored by Trcu Sz Koopmans (2002b). ^ 
For HST14176+5226, Gebhardt et al. (2003) measure a = 202 it 9kms^^ in a similar aperture, 
which is marginally inconsistent with a previous measurement of cr = 230 ± 14kms~^ by Ohyama 
et al. (2002) and a similar measurement by Treu &; Koopmans (2004). We plot the Gebhardt et 
al. (2003) value because it implies the greatest departure from the mean model. As expected, we 
find that HST14176+5226 requires a profile that is shallower than isothermal, with p oc r^^'^, and 
a rather high dark matter fraction {fcdm > 0.6). The discrepancy persists if we substitute the 
Ohyama et al. (2002) or Treu Sz Koopmans (2004) measurements, though at a reduced significance. 
While both PG1115-I-080 and HST14176-I-5226 appear to have mass profiles which are different 
than isothermal. Fig. 3 shows that the constraints on their profiles are consistent with our 95% 
confidence limit on the lens galaxy population - though only marginally consistent with our 68% 
confidence range. However, if the width of our statistical constraints reflects intrinsic scatter about 
the mean homology relation, then we might expect ~ 1/3 of lenses to be outliers at the level of 
PGl 115+080 or HST14176+5226. This fraction is consistent with the present dynamical sample. 

Consequently, our profile constraints are a sufficient proxy for measured velocity dispersions in 
gravitational lens galaxies. By simultaneously considering structure and evolution, we will derive 
limits on the evolution rate and mean star formation redshift which account for the allowed range 
over which galaxies may depart from a strict homology. 



*Notc that wc have a much smaller effective radius (re — 0'.'5 versus O'.'S), and this accounts for our acceptance of 
the constant mass-to-light ratio model which Treu & Koopmans (2002b) nominally rejects. 
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4.2. Focus on Luminosity Evolution 

We now turn to luminosity evolution. Because of the large degeneracy between the CDM 
slope n and abundance fcdm, we will further simplify the total mass model to the scale-free limit 
(T* 0, fcdm 1) Rb/Re S> 1). Note that this simplification does not have any effect on the 
evolution constraints versus the full mass model, but it does make it easier to display the results, 
particularly with regard to the interplay between mass concentration and evolution. 

4-2.1. Model 1: Single Star Formation Redshift 

Wc begin by analyzing Model 1, which postulates that lens galaxies were formed at a common 
epoch, and therefore follow a common evolutionary track as a function of redshift. This model 
is intended to describe the mean properties of the galaxy sample, and provide a standard for 
comparing results obtained by different groups. We reiterate, however, that this model does not 
provide a statistically good fit to the data. As we show in §4.2.2, including a range of star formation 
redshifts leads to a vastly improved model of the lens sample (see also vvF) . 

In Fig. 4 we plot pairwise constraints on the evolution rate dlog{M/L)B/dz, mass-to-light 
ratio index x, and logarithmic density slope n, which now describes the total mass distribution. As 
we can see from Fig. 4, the parameters in our model are correlated. The anti-correlation between 
X and d\og{M / L) B / dz is easiest to explain, as these two quantities enter the model as a product: 
log M(i?) oc {1 + x)^og L{z) + d\og{M / L) B / dz z] (see eq. 12). This effect widens the favored range 
of x, compared to the constraints derived by RKK for a fixed Zf = 3 evolution model. Specifically, 
we find x = O.lStojt, while the RKK value is x = 0.141°;^^ (both 68% C.L.). The source of the 
other correlations is not clear, though they may be related to the increasing mass scale of the lens 
sample with redshift (R03). 

Optimizing over all structural parameters, we measure an evolution rate of dlog{M/L)B/dz = 
-0.50 ± 0.19 at 68% confidence. The 95% confidence range is -0.86 < d\og{M / L) b / dz < -0.10. 
These constraints are identical for the full mass model and the scale- free limit, and are not signif- 
icantly altered by setting x to reflect locally measured FP slopes. Our results are consistent with 
the findings of R03, who measured dlog{M / L) b / dz = -0.54 ± 0.09 (68% C.L.) from a sample of 
28 lens galaxies under the assumption of isothermal mass profiles. Simultaneously optimizing the 
galaxy structure does weaken constraints on the evolution rate, but not to the point that they are 
uninteresting. The conclusion that lenses favor a slow evolution rate is unchanged. 

We next consider bounds on the mean star formation redshift by using evolution tracks derived 
from the BC96 population synthesis models. Pairwise constraints are plotted in Fig. 5. The 
constraint on the mass-to-light ratio index (x = 0.14lQ'^g, 68% C.L.) is somewhat tighter than that 
derived using the linear evolution rate. The small difference is due to the fact that the real tracks 
do not map into the linear tracks for later formation redshifts; e.g., there is no real track which is 
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well approximated by a linear model for < 1.5. 

The best-fit model has a mean star formation redshift of (z/) ~ 2.2, and the data are consistent 
with stellar ages as old as the universe itself. Coming from the other side, the constraints are far 
more restrictive: (z/-) > 1.8 at 68% confidence, and (z/) > 1.5 at 95% confidence. Note that these 
results are not sensitive to our photometric conversion technique (§3.1). We find quantitatively 
similar bounds if the luminosities are derived from a = 1.5 spectrum, or from the spectra which 
best fit the colors of individual lenses. Excluding the four systems with estimated source redshifts 
slightly weakens the 95% confidence limit to (z/) > 1.4. 

The above evolution constraints are remarkable, considering that we have assumed no specific 
shape for the mass profile of lens galaxies. While we do assume that eaxly-type galaxies comprise 
a regular population, the range of permitted profiles crudely approximates the degree of scatter 
allowed by the AMR relations. We can therefore account for possible departures from homology, 
without directly measuring individual velocity dispersions. Based on this simultaneous analysis of 
structure and evolution, we conclude that, on average, early-type galaxies in low-density environ- 
ments have rather old stellar populations ((zj) ~ 2), similar in age to their cluster counterparts. 



4-2.2. Model 2: Range of Star Formation Redshifts 

We now turn to Model 2, which allows galaxies to form over a range of redshifts: dP/dlog Zf = 
constant between Zf^^in and Zf^^ax- (Substituting a model in which the probability density of star 
formation is uniform over Zf, rather than log 2;/, has little effect on any of our primary results.) 
In Fig. 6 we plot constraints on the star formation model, optimizing over the global mass profile 
and mass-to-light ratio parameters. We have re-mapped the parameter space in terms of {zf^min + 
Zf^rnax)/"^ and {zf^max — Zf^min)/'^, as this better illustrates the physical meaning of the constraints. 
In the main panel (upper left), solid lines are the 68% (Aln>C = —1.15) and 95% (Aln>C = —3.09) 
confidence limits in two dimensions; dotted lines are the 68% (Aln£ = —0.5) and 95% (Aln>C = 
—2.0) confidence limits in one dimension. In subsequent panels of Fig. 6 we overlay contours to 
better describe the distribution of star formation redshifts in the model. Specifically, we plot the 
minimum, maximum and mean formation redshifts, as well as the fraction of stars forming earlier 
than Zf = 2.0 and later than zj = 1.5. 

The lens sample favors models with significant scatter in star formation redshift: Zf^^ax ~ 
^f,min > 0-4 at 95% confidence and Zf^^nax " Zf,min > 1-7 at 68% confidence, each based on one- 
dimensional likelihood differences. The required scatter in Zf increases with mean star formation 
redshift. This can be understood by noting that luminosity evolution depends much more sensitivity 
on Zf when Zf is small. At higher mean formation redshifts, a broader redshift range is required 
to produce the same luminosity scatter for galaxies at z < 1. The nearly linear nature of the 
degeneracy - {zf^max — ^f,min) 1'^ oc {^Zf^min^^f.max)!'^ ~ indicates that the analysis is most sensitive 
to Zf^minj which is consistent with the rapid variation of evolutionary tracks for late formation 
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redshifts. For the lens sample, Zf^min cannot be much smaller than 1. Undoubtedly the high 
redshift {z > 0.5) lens galaxies will dominate this constraint, as they are most sensitive to Zf. In 
contrast, models with much later star formation could be allowed if only low redshift lenses were 
considered. We note that the distribution of star formation redshifts can vary significantly among 
acceptable models of the lens sample. For example, the most favored models {AC < —1.15) allow 
anywhere from to 40% of galaxies to form at Zf < 1.5 (Fig. 6). We therefore conclude that 
low-redshift star is an important component of our model. However, in all of our acceptable models 
the mean star formation redshift is relatively high {{zf) > 1.5). 

It is interesting to note that allowing a range of star formation redshifts leads to a relatively 
modest improvement versus a single-zj scenario, at least from a maximum likelihood perspective. 
For example, we find that the best-fit single-z/ model {zf^min = Zf^max ~ 2.2) is at only AlnC = 
—2.1 with respect to the best overall model. There is, however, a large improvement in terms 
of finding models which provide a good statistical description of the data. As the spread in Zf 
is increased from zero, the likelihood describing the data (ln£) increases slowly, while the mean 
likelihood of data sets drawn from the Monte Carlo {{InLsim)) decreases due to the fact that the 
model is inherently more diffuse. This leads to a rapid improvement in matching the statistical 
properties of the lens sample to that of the models. For example, the best-fit single-zj model has 
a likelihood which is lower than its corresponding {In Csim) by |Aln>C| ~ 12 (this is nearly equal 
to (x^ — Ndof)/^, as expected), and well outside the range described by ln>Cf™. Models with 
a single star formation redshift are thus clearly inconsistent with the statistical properties of the 
data. However, by Zf^rnax — Zf^min ~ 1 we find models for which | ln£ — (}.nCsim)\ ^ lii-^fiTrP- Such 
models meet the necessary condition for statistical compatibility, in the sense that the real lens 
sample may actually be drawn from them. This regime is marked by the shaded area in each panel 
of Fig. 6. 

In conclusion, models in which early-type field galaxies form homologously over a range of 
redshifts offer a vastly improved representation of the actual lens sample. Allowing some fraction 
of galaxies to form at low redshift is an important component of these models, though the mean 
star formation redshift is still relatively high. The best-fit models meet the necessary condition for 
statistical consistency with the lens sample, based on the interpretation of likelihoods. As noted in 
§3.1, altering the assumed luminosity errors does not change this conclusion. 

5. Are Lensing Analyses Consistent? 

5.1. Statistical Results 

The luminosity evolution of gravitational lens galaxies has recently been explored by van de 
Ven et al. (2003), who use the FP to derive an evolution rate of d\og{M / L) b / dz = -0.62 ± 0.13 
from a sample of 26 lenses. This is slightly faster than the findings of either R03 (—0.54 zt 0.09) 
or this paper (—0.50 ± 0.19 for Model 1), even though the lens samples and photometry are nearly 
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the same. Furthermore, vvF claim that rather late mean star formation redshifts are permitted 
((z/) = l.Stli, 68% C.L.), while much of this range is excluded by R03 {{zf) > 1.8, 95% C.L.) 
and the combined structure/evolution analysis of §4.2.1 {{zf) > 1.5, 95% C.L.). While the results 
of RKK and vvF are technically consistent, it is instructive to discuss three factors which account 
for the small differences: data weighting, stellar evolution models, and sample selection. 

First, differences in data weighting account for most of the systematic disparity in evolution 
rates. We set a fixed uncertainty for all galaxy luminosities, ensuring that each lens has a similar 
relative contribution to the fit.^ In contrast, wF use uncertainties which they derive from their 
photometric conversion. Based on their quoted errors on the surface brightness, some galaxies are 
far more important to the fit than other galaxies. Data weighting has a surprisingly large impact on 
the derived evolution rate. vvF already demonstrate this by testing an error term (to account for 
scatter) which is added in quadrature with other errors. This has the effect of weighting the galaxies 
more uniformly, and slows the evolution rate from d\o%{M / L) b / dz = —0.62 ±0.13 to —0.56 ±0.12. 
We have confirmed these results by reanalyzing the data tabulated in vvF. Furthermore, we find 
that the vvF data yield an even slower evolution rate of d\og{M / L)b / dz = —0.46 ± 0.12 (very 
similar to our value) if we impose uniform weighting. The choice of weighting is clearly important 
for constraining the evolution model, and the vvF method is not unreasonable. However, we believe 
that our scheme provides a more realistic description of the global properties of the lens galaxy 
population, since uncertainties in Model 1 are certain to be dominated by intrinsic scatter in the 
mass profile and formation redshift, rather than by measurement errors. Galaxies should not be 
segregated based on nominal photometric errors if those errors are much too small to account for the 
observed scatter in the FP. Moreover, weighting galaxies by their photometric errors can introduce 
a Malmquist bias, which we may be seeing in the vvF analysis. Relatively brighter galaxies are 
expected to have better photometry, giving them greater statistical weight. Since brighter galaxies 
at high redshift will also imply a greater degree of luminosity evolution, the mean evolution rate 
will tend to be biased toward faster values. Nearby lens galaxies, which are bright but have little 
ability to distinguish between star formation models, would also be more strongly weighted than 
the distant galaxies which are vital for determining stellar age through luminosity evolution. 

Second, the constraints on stellar age arc affected by models of the evolution tracks e{z) as a 
function of star formation redshift. We use the tracks derived from the BC96 stellar synthesis code. 
In contrast, vvF use an approximation to the Worthey (1994) simulation results: log(M/L)s oc 
K\og{t — tf), where t — tj is the stellar age and k ~ 0.93. The models agree very well for Zf > 2, 
but the BC96 model evolves more rapidly than the wF fitting function for Zf < 1.5. As a result, 
later star formation redshifts are more strongly disfavored with the BC96 model, which tightens 
our lower bound on {zf). We tested this hypothesis by reanalyzing the vvF data (with their quoted 
uncertainties). For example, while vvF find that their Zf = 1.3 track is marginally consistent with 
their data, the corresponding BC96 track is unambiguously rejected at Ax^ = 12. We believe that 



^Recall that lenses with estimated Zs are weighted down relative to those with measured Zs (see §3.1). 
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the BC96 model facilitates more accurate constraints on the mean star formation redshift, as it 
offers a more detailed treatment of young stellar populations than the wF approximation. 

Third, differences in sample selection have a small systematic effect on the evolution results. 
Systems without measured source rcdshifts are rejected by vvF, but are included in our analy- 
sis (with an estimate of Zg = 2.0 it 1.0). Two of these systems - MG1131+0456 (Hewitt et al. 
1988) and B1938+666 (Patnaik et al. 1992) - are particularly interesting. Each is at high redshift 
{zd = 0.84 and 0.88, respectively) and exhibits a larger than average mass-to-light ratio (especially 
MG1131-I-0456). Hence, they force the model to slightly slower evolution rates. While excluding 
such systems alters constraints on the mean star formation redshift at the level of only ~ 0.1 (see 
§4.2), there is no reason that these lenses should be rejected just because they are outliers. In 
all other respects, they look like typical elliptical galaxies (Tonry & Kochanek 2000). One may 
argue, however, that our choice for the estimated source redshift biases the result. We note that 
the implied mass-to-light ratios of these galaxies can be reduced if the sources are at a much higher 
redshift {zg > 4). Based on the optical properties of the sources, this is indeed a reasonable sce- 
nario. However, we find that these systems have an equally important role in the fit even if we 
substitute "measured" values of Zg = 4.0 for our estimated values of Zs = 2.0 ± 1.0 - while the 
absolute deviation from the global model is reduced, this is almost completely compensated by the 
reduced fit tolerance for systems with a measured Zg. Therefore, the inclusion of these systems in 
our analysis is strongly justified, and their effects on the fit are real. 

Finally, it is interesting to note that R03 and vvF have no systematic differences in rest frame 
magnitudes, despite using alternate techniques for photometric conversion. However, vvF appear to 
have more scatter in their data points, and we trace this to the photometry. Larger scatter slightly 
broadens the vvF confidence regions compared to our own, and therefore leads to a somewhat 
weaker rejection of late formation redshifts. 

5.2. Individual Lenses 

Several studies have focussed on the structure and evolution of individual lens galaxies (Treu 
et al. 2002a, b; Koopmans & Treu 2003; Koopmans et al. 2003; Gebhardt et al. 2003; Treu & 
Koopmans 2004). While these analyses have the benefit of using measured velocity dispersions 
in the fundamental plane relation, there are also two significant drawbacks. First, to determine 
the evolution implied by a single galaxy or a small number of galaxies, one must make an explicit 
comparison against a local FP intercept measured from other samples. This often means using the 
intercept for ellipticals in the Coma cluster to evaluate the evolution of field galaxies, even though 
these populations should not, in general, have identical intercepts at z = 0. Second, such studies 
may be susceptible to a Malmquist bias, which presumably makes it easier to measure velocity 
dispersions in more luminous, higher surface brightness galaxies with younger stellar populations. 
In this case, a sample of lenses with measured velocity dispersions, especially at higher redshift, 
would be biased toward faster evolution rates. Such effects may already be evident in the current 
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dynamical sample. 

The Lens Structure and Dynamics survey (LSD; Koopmans k. Treu 2002, 2003; Treu & Koop- 
mans 2002a; Koopmans et al. 2003; Treu & Koopmans 2004) has measured the velocity dispersions 
of several gravitational lenses, which are combined with HST photometry to constrain structure 
and evolution. Five of these lenses appear in our sample: MG2016+112 (Lawrence et al. 1984), 
0047-281 (Warren et al. 1996), B1608+656 (Myers et al. 1995), HST14176+5226 (Ratnatunga et 
al. 1995), and PG1115+080 (Weymann et al. 1980). As we discussed in §4.1, our statistical con- 
straints on the galaxy mass profile are largely consistent with those derived through dynamical 
measurements and modeling. Note, however, that Treu & Koopmans (2004) now favor a mean log- 
arithmic slope which is significantly shallower and marginally inconsistent with isothermal, based 
on a combined analysis of six lens galaxies. Two of these galaxies do not currently appear in our 
sample, and help weight the LSD analysis toward shallower profiles. 

The LSD sample also favors faster evolution rates and younger stellar populations than our 
results indicate. This may be due in part to a luminosity-related selection bias, or at least small 
number statistics. For example, our evolution results for MG2016-I-112 and 0047-281, based on a 
fixed local FP intercept, agree well with those measured in the LSD survey - but each lens scat- 
ters on the "young" side of our evolutionary trend-lines. Moreover, the LSD survey has observed 
B1608+656, whose spectral properties suggest that the galaxy has undergone significant star for- 
mation triggered by an ongoing merger (Koopmans et al. 2003). Using dust corrections based on 
observed color gradients, they derive a luminosity which is much larger than that of passively- 
evolving galaxies, and a luminosity evolution which is much faster than our estimate for the overall 
sample.^ Based on an analysis of their lens sample, Treu & Koopmans (2004) find an evolution 
rate of d\og{M / L) b / dz = —0.72 zt 0.10, which is systematically faster but still broadly consistent 
with the evolution rate we derive by optimizing over the mass profile. 

Departures from homology can also have interesting effects on the derived evolution rates. If 
we compare the implied FP intercept with a fixed local value, then a larger velocity dispersion 
yields a slower evolution rate, and vice versa. PG1115+080, whose measured velocity dispersion is 
much higher than the isothermal prediction (Tonry 1998; Treu &: Koopmans 2002b), must undergo 
virtually no luminosity evolution to z = 0.31 if it is to fall on the local FP. Alternatively, if the 
galaxy evolves at a standard evolution rate, its structure cannot be consistent with the rest of the 
early-type galaxy population. HST14176-I-5226, in contrast, yields a significantly faster evolution 
rate if the isothermal velocity dispersion estimate is replaced with one of its measured values. 



®The diflerences are exaggerated by the foreground-screen extinction model used by Koopmans et al. (2003) to 
correct the galaxy flux. A foreground screen produces the largest correction, while a more realistic embedded screen 
or a mixture of stars and dust would reduce the luminosity correction (e.g., Witt, Thronson, & Capuano 1992). We 
note, however, that B1608+656 still appears over-luminous in our model. 

'^It is interesting to note that the evolution rate derived by Treu & Koopmans (2004) slows dramatically if the 
local FP intercept is simultaneously fit. 
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Unless the quoted error bars on the velocity dispersions have been underestimated, the findings 
of Gebhardt et al. (2003), Treu & Koopmans (2004) and others suggest that early- type galaxies 
exhibit significant structural diversity. Such deviations from homology warn against using small 
numbers of lenses to constrain an evolution model meant to describe the galaxy population as a 
whole. 

6. Summary and Discussion 

Evolution studies of early-type galaxies based on gravitational lens samples (Kochanek et al. 
2000; R03; vvF) have often been criticized for using estimated, rather than measured, velocity 
dispersions, because a mass model is required to convert the accurately measured projected mass 
into a dynamical estimate. The primary concern has been that the preferred mass model for lenses, 
an isothermal model with a flat rotation curve, could be incorrect. The secondary concern is that 
early-type galaxies may not be well characterized by any single mass distribution that is homologous 
to the luminosity distribution. By simultaneously modeling both the structure and evolution of 
lens galaxies, our present analysis helps address these concerns. Specifically, we determine the 
best-fit mass profile for lens galaxies (as well as the range over which individual lenses might 
scatter), and account for these effects on the mean evolution model. Our technique builds on the 
homologous two-component mass models of RKK, which were constrained using the ensemble of 
aperture mass-radius relations derived from lensed image geometries. The homology formalism is 
virtually identical to the fundamental plane, with the AMR relations substituting for measured 
velocity dispersions. In essence, this allows us to trace luminosity evolution with the FP, while 
directly using the quantity (aperture mass) that strong lensing naturally constrains. 

We first updated constraints on the structure of early-type galaxies, assuming that lenses are 
characterized by a single star formation redshift. We find that uncertainty in the evolution model 
has little effect on our structural results. The FP strongly favors nearly isothermal (n = 2) mass 
profiles on scales of a few effective radii, with a mean density slope of n = 2.06 ± 0.17 (68% C.L.). 
These statistical constraints are consistent with the isothermal paradigm favored by modeling 
(Kochanek 1995; Cohn et al. 2001; Muiioz et al. 2001; Rusin et al. 2002; Winn et al. 2003) and 
dynamical (Treu &; Koopmans 2002a, 2002b; Koopmans Sz Treu 2003; Koopmans et al. 2003) studies 
of individual lens galaxies, the dynamics of local ellipticals (e.g., Rix et al. 1997; Romanowsky & 
Kochanek 2001; Gerhard et al. 2001), and their X-ray halos (e.g., Fabbiano 1989; Matsushita et 
al. 1998; Loewenstein & White 1999). Note, however, that Treu & Koopmans (2004) favor models 
with systematically shallower profiles, while Romanowsky et al. (2003) favor significantly more 
concentrated models. Our analysis indicates that a small but non-zero fraction {fcdm > 0.08, 95% 
C.L.) of the mass projected inside two effective radii must be in the form of an extended dark 
matter halo, independent of any assumed stellar mass-to-light ratio. We also constrain the increase 
in the total mass-to-light ratio with galaxy luminosity (T* oc L^) : x = 0.18]toig for the linear 
evolution model, and x = 0.14^Q^g for the detailed BC96 evolution tracks (both 68% C.L.). These 
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results are consistent with recent dynamical analyses of early- type galaxies (Gerhard et al. 2001; 
Bernardi et al. 2003; Borriello et al. 2003; Padmanabhan et al. 2004). Because the FP is closely 
related to the homology formalism, om constraint on x can be considered the first constraint on the 
slopes of the lensing fundamental plane: 1.1 < a < 2.0 and 0.3 < (5 < 0.4. These constraints are 
not yet very restrictive, but they are consistent with measurements of the local FP (e.g., Jorgensen 
et al. 1996; Pahre et al. 1998a; Bernardi et al. 2003). 

Next we considered the luminosity evolution of lens galaxies, and find a small effect from the 
uncertainties in the mass model. We began by investigating a model in which all galaxies form at 
a common redshift (Model 1). The same assumption has been made in almost every analysis of 
early-type galaxies, and therefore provides a basis for comparing results. The lens sample favors a 
linear evolution rate of dlog{M / L) b / dz = —0.50 ± 0.19 (68% C.L.). This constraint includes the 
spread in structural properties that are consistent with the AMR relations, and is therefore weaker 
than the value derived by R03 based on isothermal profiles, d\og{M / L) b / dz = — 0.54± 0.09 (68% 
C.L.). Assuming the standard cosmological parameters and a Salpeter IMF, we require a mean star 
formation redshift of {zf) > 1.8 at 68% confidence, and (zf) > 1.5 at 95% confidence, based on the 
BC96 spectral population models. These constraints are remarkable, considering that we make no 
assumption regarding the shape of the galaxy mass distribution or the slopes of the fundamental 
plane. Our only assumption is that these galaxies are structurally homologous, but even this is not 
restrictive: we obtain identical conclusions if the increase in mass-to-light ratio with luminosity is 
modeled as a systematic non-homology. We favor slightly older stellar populations in lens galaxies 
than van dc Ven et al. (2003), who suggest that lenses could be consistent with mean star formation 
rcdshifts as late as (zf) 1.2. We have traced these differences to data weighting, evolution models 
and sample selection. In conclusion, our single-z/ analysis again indicates that, on average, early- 
type galaxies in low-density environments have old stellar populations - perhaps as old as their 
counterparts in rich clusters. 

Model 1 does not provide a good fit to the data given the estimated statistical uncertainties, 
much as the scatter in the standard FP also exceeds measurement errors. This suggests that 
additional sources of complexity have not been properly taken into account. In particular, we can 
be confident that galaxies are neither perfectly homologous nor form at a common redshift. The 
scatter about our best fitting solution corresponds to a spread of approximately 30-35% in some 
combination of mass (deviations from homology) and luminosity (range of star formation epochs). 
For our overall constraints on the mean structure and the mean star formation redshift, we included 
these additional uncertainties in our error estimates. 

Wc explored one source of this scatter using the seven lens systems in our sample with direct 
velocity dispersion measurements. For each system we can estimate the parameters of the mass 
model needed to reconcile the velocity dispersion with the aperture mass constraint. In general, all 
seven are consistent with our statistical results, given their mutual errors. This is just a rephrasing 
of the fact that our error estimates in Model 1 have correctly included the scatter created by 
deviations from homology or a spread in formation epochs. Four of the seven lenses have parameters 
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and parameter degeneracies similar to our statistical models. PGl 115+080 requires a steeper mass 
distribution, while HST14176+5226 requires a shallower mass distribution. However, systematic 
errors in measuring and modeling stellar velocity dispersions might significantly mitigate these 
discrepancies. Q2237+0305, the bulge of a nearby spiral galaxy (Huchra et al. 1985), should 
probably be described by a more complex mass model, but an isothermal profile can still describe 
its inner dynamics. 

We also investigated a model which can directly account for the observed scatter in the ho- 
mology relation by allowing a range of star formation redshifts. Specifically, we assumed that stars 
form between Zf^jnin and Zf^^nax-, with uniform probability density in log 2;/, and that the galaxies 
are structurally homologous at formation. By ignoring any intrinsic distribution of mass profiles, 
we expect to maximize the range of formation redshifts necessary to reproduce the observed scat- 
ter. Applying a likelihood analysis to the lenses, we find that a significant spread in formation 
redshift is favored: {zf^max — Zf^min > 0.4 at 95% confidence, and Zf^max — Zf^min > 1-7 at 68% 
confidence), while the single-z/ model is ruled out. Moreover, simulated data sets created from a 
subset of the acceptable models yield likelihoods similar to that of the real data. Allowing a range 
of formation redshifts therefore provides a vastly improved statistical description of the lens sample. 
The required scatter in Zf increases almost linearly with the mean star formation redshift. This 
can be understood by noting that luminosity evolution depends much more sensitively on Zf when 
Zf is small. We find that the distribution of star formation redshifts can vary significantly among 
acceptable models of the lens sample. For example, the most favored models {AjC < —1.15) allow 
anywhere from to 40% of galaxies to form at Zf < 1.5 (Fig. 6). We therefore conclude that the 
stars in early- type field galaxies form over a substantial range of redshifts, and that some fraction 
of these stars may have formed as recently as Zf ~ 1. Our analysis bolsters the findings of Treu et 
al. (2001, 2002), van de Ven et al. (2003), Koopmans et al. (2003), and van der Wei et al. (2004), 
who argue that late star formation in some fraction of field ellipticals is essential for explaining 
their evolutionary history. However, we depart from a few recent claims (Treu et al. 2001, 2002; 
Gebhardt et al. 2003) that the mean star formation redshift of field ellipticals may be as late as 
Zf = 1. In all of our statistically acceptable models, the mean star formation redshift is relatively 
high {{zf) > 1.5). 

Understanding the star formation history of early-type field galaxies remains a work in progress. 
There is, however, growing consistency among results based on colors (Menantcau et al. 2001; vvF; 
Bell et al. 2004), spectral properties (Bernardi et al. 1998; Schade et al. 1999; Kuntschner et al. 
2002; Treu et al. 2002) and the fundamental plane (Kochanek et al. 2000; van Dokkum et al. 2001; 
Treu et al. 2001, 2002; Rusin et al. 2003; van de Ven et al. 2003; van Dokkum & Ellis 2003; van 
der Wei et al. 2004; Treu & Koopmans 2004; this paper). Taken together, these analyses present 
a strong case that field ellipticals formed the bulk of their stars between Zf = 1.5 and 2.0, and 
are therefore not much younger than their counterparts in high-density environments. This largely 
disagrees with the predictions of semi-analytic CDM models (e.g., Baugh et al. 1996; Kauffmann 
1996; Kauffman & Chariot 1998; Diaferio et al. 2001). Moreover, based on the internal scatter of 
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the various samples, it appears that the stars in field ellipticals are likely to have formed over a 

range of rcdshifts, with some fraction of the stars forming as late as zj ~ 1. The degree to which 
these galaxies experience late star formation is a subject of continuing debate, and several of the 
above results disagree on the details. 

What is the source of these remaining discrepancies among analyses of early-type field galaxies? 
Sample selection is probably the best starting hypothesis. Each of the above evolution studies 
employs diflFerent criteria for selecting early- type galaxies. For example, Treu et al. (2001, 2002) 
use morphological/magnitude/color cuts, van Dokkum &; Ellis (2003) use morphological/magnitude 
cuts, and Gebhardt et al. (2003) use the bulge fraction. The lens sample, by contrast, is selected 
on mass, with large velocity dispersion ellipticals dominating the lensing optical depth. The mass 
selection makes the gravitational lens sample unique, as it is the only one not to be affected by 
Malmquist biases related to luminosity, color, or other star formation signatures. However, lensing 
also selects more massive samples of ellipticals. Assuming an isothermal profile, the 22 lenses 
we analyzed have a mean velocity dispersion of ~ 240kms~^, about 10 — 25% higher than any 
of the other samples listed above. The mean velocity dispersion of lens galaxies also increases 
with redshift, due to angular selection effects (R03). Because there is evidence to indicate that 
more massive early- type galaxies may be older (van der Wei et al. 2004), the higher mean velocity 
dispersion of the lens sample could partially account for its slower evolution rate. Additional work 
is clearly necessary to quantify the benefits and biases of the various samples. 

Whatever the remaining uncertainties, the lens galaxies are an extraordinarily powerful probe 
of galactic structure and evolution. Using only 22 lenses, we can simultaneously constrain the 
mean mass profile of early- type galaxies, the dependence of the mass-to-light ratio on luminosity, 
and the rate of stellar evolution, with statistical and systematic uncertainties that are competitive 
with all other methods. The power of gravitational lenses is a result of the aperture mass-radius 
relations derived from the image geometries, but good photometry and measured redshifts are 
needed to make use of these constraints. Follow-up imaging and spectroscopy is therefore vital 
to expanding the current lens sample, which may soon enable significantly improved statistical 
analyses. Such future studies could investigate lenses as a function of color, velocity dispersion, 
environment or spectral properties. In addition, a sufficiently large sample of lenses could probe 
not only luminosity evolution, but structural evolution as well (e.g., Mao &: Kochanek 1994; Rix et 
al. 1994; Ofek, Rix &: Maoz 2003), thereby tracing the merger history in low-density environments. 
Detailed dynamical studies of individual lenses, such as those being carried out by the LSD survey, 
are also vital, as they can directly map the diversity of the lens galaxy population. Similar physical 
constraints can be obtained using time delay measurements in gravitational lenses, if we can safely 
assume that the Hubble constant is accurately measured. With a known Hubble constant, time 
delays measure the surface mass density near the lensed images (Kochanek 2002), which would 
break the model degeneracies for individual lenses in the same manner as a velocity dispersion 
measurement. Models for the time variability created by the microlensing of lensed images can 
also help break the degeneracy because the light curves can be used to determine the fraction of 
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the surface mass density in the form of stars (Kochanek 2004; see also Schechter & Wambsganss 
2002 for a discussion of statistical constraints based on flux ratio anomalies). Because of their 
unique properties, gravitational lenses will continue to contribute significantly to our knowledge of 
early-type galaxies, and provide a testing ground for galaxy formation theories. 
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Fig. 1. — The structure of lens galaxies. All calculations assume a single star formation redsliift 
(Model 1), but there is little change if we allow for a range of star formation redshifts (Model 2). 
Top: Constraints on the shape of the mass profile, which is set by the CDM mass fraction inside 2 
effective radii (fcdm), and the inner logarithmic slope of the CDM halo {n, with density p oc r^"). 
The halo is modeled with a cuspy profile (Munoz et al. 2001), which breaks to an outer logarithmic 
slope of 3 at a characteristic radius Rj,. We show constraints for models with Rb/Re = 3, 10 and 
50, where Re is the optical effective radius. Solid contours represent Ax^ = 2.30 and 6.17, the 68% 
and 95% confidence levels for two parameters. Dotted lines represent Ax^ = 1 and 4, the 68% and 
95% confidence levels for one parameter. The errors have been rescaled so that the best-fit model 
has = Ndof- Bottom: The mass profiles of models favored at 68% confidence. Solid lines are 
the projected masses inside R/R^. Profiles are normalized to a fixed projected mass at R = 2i?e- 
For comparison we show the dc Vaucouleurs profile (dotted line), and an offset isothermal profile 
(dashed line). While the allowed models exhibit a wide range of dark matter abundances and break 
radii, they all have total mass profiles which are approximately isothermal over the radial range 
spanned by the lensed images. 
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Fig. 2. — The recovered local FP intercept 70 as a function of mass profile, for h = 0.65. We assume 
isotropic orbits {Piso = 0) and a cuspy dark matter halo with Rb/Re = 50. Velocity dispersions are 
estimated in an aperture with radius Re/8. The FP slopes are fixed at a = 1.25 and (3 = 0.325, 
corresponding to x = 0.3. Solid lines are contours of best-fit 70. Dotted lines are the 68% and 
95% confidence limits on the model (in two dimensions). These limits are slightly different than 
those shown in Fig. 1 because we have fixed the value of x. We find that the local FP intercept 70 
varies slowly over our parameter space, and must therefore be measured to a precision of <C 0.1 to 
improve the model constraints. Bender et al. (1998) measures 70 ~ —9.0 for a = 1.25 and /? = 0.32. 
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Fig. 3. — Velocity dispersions for lens galaxies. Solid lines are contours of constant velocity disper- 
sion, estimated in circular apertures with area equal to that of the respective apertures in which 
they were observed. Dotted lines are the two-dimensional model constraints for Rij/Re = 50, shown 
in Fig. 1. The shaded areas are the measured velocity dispersions (68% C.L.). As previously noted, 

the dynamics of two galaxies may require profiles which are significantly different than isothermal. 
PG1115+080 is steeper, while HST14176+5226 is shallower. Substituting the higher Ohyama et al. 
(2002) or Trcu & Koopmans (2004) velocity dispersion for the Gcbhardt et al. (2003) value would 
bring HST14176+5226 into better agreement with our statistical results. 
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Fig. 4. — Constraints on the structure and evolution rate in Model 1. Plotted are pairwise con- 
straints on the linear evolution rate d\og{M / L) b / dz , mass-to-light ratio index a;, and logarithmic 
density slope n for a scale-free total mass distribution. Contours are drawn as in Fig. 1. 




Fig. 5. — Constraints on the structure and mean star formation redshift {zf) in Model 1. The 
panels are analogous to those in Fig. 4. Horizontal dotted lines mark (z/) = 1.3, 1.4. ..1.7. The lens 
sample favors old stellar populations, with {zf) > 1.5 at 95% confidence. 
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Fig. 6. — Top left: Constraints on the detailed star formation history of lens galaxies (Model 2). 

Stars form over an interval spanning Zfjnin to Zf^maxi with uniform probability density in \ogZf. 
Confidence limits are plotted using solid and dotted contours as before, and are calculated from 
likelihood differences. We have optimized over all other model parameters. Significant scatter in 
Zf is favored, and the single-zj scenario is excluded. Models which match the statistical properties 
of the actual lens sample are shaded in each panel. For these models, the lens data are consistent 
with the likelihood distribution determined from simulated data sets. The remaining panels show 
contours illustrating the distribution of star formation redshifts. Here our previous constraints are 
shown as dotted contours. Top center: Minimum Zf. Top right: Maximum Zf. Bottom left: Mean 
Zf. Bottom center: Fraction of galaxies forming at Zf > 2.0. Bottom right: Fraction of galaxies 
forming at Zf < 1.5. 



